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Abstract. This note provides a succinct proof of a 1973 tlieorem of Lieb that estabhshes the 
concavity of a certain trace function. The development relies on a deep result from quantum 
information theory, the joint convexity of quantum relative entropy, as well as a recent argument 
due to Carlen and Lieb. 

o 

^ ■ 1. Introduction 

In his 1973 paper on trace functions, Lieb establishes an important concavity theorem [Lie73t 
' Thm. 6] concerning the trace exponential. 

Theorem 1 (Lieb). Let H be a fixed self-adjoint matrix. The map 

^ A I — ^ tr exp (ii" + log A) (1) 

is concave on the positive- definite cone. 

The most direct proof of Theorem[T]is due to Epstein |Eps73| ; see Ruskai's papers |Rus02[[Rus05] 
^ ■ for a condensed version of this argument. Lieb's original proof develops the concavity of the 

, function ([1]) as a corollary of another deep concavity theorem [Lie 731 Thm. 1]. In fact, many 

convexity and concavity theorems for trace functions are equivalent with each other, in the sense 
that the mutual implications follow from relatively easy arguments. See |Lie731 §5] and |CL081 §5] 
for discussion of this point. 

The goal of this note is to demonstrate that a modicum of convex analysis allows us to derive 
Theorem [1] directly from another major theorem, the joint convexity of the quantum relative en- 
tropy. The literature contains several elegant, conceptual proofs of the latter result; for example, 
see |Eff09| . These arguments now deliver Lieb's theorem as an easy corollary. 
^ i The author's interest in Theorem [1] stems from its striking applications in random matrix theory; 

' refer to the paper [TrolOj for a detailed discussion. Researchers concerned with these developments 

may find the current approach to Lieb's theorem more transparent than earlier treatments. 

The main ideas in our presentation are drawn from the work of Carlen and Lieb |CL08j . so 
this dispatch does not contain a truly novel technique. Nevertheless, this note should be valuable 
because it provides a geometric intuition for Theorem [1] and connects it to another major result. 

2. Background 

Our argument rests on the properties of a function, called the quantum relative entropy, which 
can be interpreted as a measure of dissimilarity between two positive-definite matrices. 

Definition 2. Let X,Y be positive- definite matrices. The quantum relative entropy of X with 
respect to Y is defined as 

D(X; Y) := tr(X log X - X log Y -{X -Y)). 

Other appellations for this function include quantum information divergence and von Neumann 
divergence. 
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The quantum relative entropy has a nice geometric interpretation |DT071 §2.2 and §2.6]. Define 
the quantum entropy function ^{X) := tr(J5ClogX) for a positive-definite argument. The diver- 
gence D{X;Y) can be viewed as the difference between (p{X) and the best affine approximation 
of the entropy (p at the matrix Y. That is, 

D(X; Y) = ip{X) - [ip(Y) + (VifiY), X-Y)]. 

The entropy (/9 is a strictly convex function, which implies that the affine approximation strictly 
underestimates ip. This observation yields the following result. 

Fact 3. The quantum relative entropy is nonnegative: 

B{X;Y) > 0. 

Equality holds if and only if X = Y . 

In quantum statistical mechanics, Fact[3]is usually called Klein's inequality. Another proof proceeds 
by showing that certain functional relations for scalars extend to matrix trace functions |Pet941 §2]. 

The convexity properties of quantum relative entropy have paramount importance. We require 
a major theorem, due to Lindblad |Lin74l Lem. 2], which encapsulates the difficulties of the proof. 

Fact 4 (Lindblad). The quantum relative entropy is a jointly convex function. That is, 

D(tXi + (l-t)X2; tYi + {l-t)Y2) <t-B{Xi;Yi) + {l-t)-B{X2;Y2) forte [0,1], 

where Xi and Yi are positive definite for i = 1,2. 

Fact m follows easily from Lieb's main concavity theorem |Lie73l Thm. 1]; Bhatia's book |Bha971 
§IX.6 and Prob. IX. 8. 17] offers a clear account of this approach. The literature contains several other 
elegant proofs; see the papers |Uhl771 IPW781 IAnd791 IHanOGj . We single out Effros' work |Eff09] 
because it is accessible to researchers with experience in matrix theory and convex analysis. 

Our final tool is a basic result from convex analysis which ensures that partial maximization of 
a concave function produces a concave function |CL081 Lem. 2.3]. We include the simple proof. 

Proposition 5. Let /(•;•) be a jointly concave function. Then the function y i— ?• maxx f{x;y) 
obtained by partial maximization is concave, assuming the maximum is always attained. 

Proof. For each pair of points yi and y2, there are points xi and X2 that satisfy 

f{xi; yi) = max^. f{x; yi) and f{x2; ^2) = max^. f{x; 2/2)- 

For each t G [0, 1], the joint concavity of / implies that 

max^. f{x; tyi + (1 - t)y2) > f{txi + (1 - t)x2;tyi + (1 - t)y2) 

>t-f{xi-yi) + {l-t)-f{x2;y2) 

= t ■ max^. f{x; yi) + {I - t) ■ max^. f{x; ^2)- 

In words, the partial maximum is a concave function. □ 

3. Proof of Lieb's Theorem 

We begin with a variational representation of the trace, which is a restatement of the fact that 
quantum relative entropy is nonnegative. The symbol >- denotes the positive-definite order. 

Lemma 6 (Variational Formula for Trace). Let Y be a positive- definite matrix. Then 

tr 1^ = max tr(X log F - X log X + X). 

Proof. Introduce the definition of the quantum relative entropy into Fact [3] and rearrange to reach 

tr Y > tr(X log r - X log X + X). 
When X = Y , both sides are equal, which yields the advertised result. □ 
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The main result follows quickly using the variational formula and the other tools we have as- 
sembled. As noted, the structure of this argument is parallel with the approach of Carlen-Lieb to 
another concavity theorem |CL081 Thm. 1.1]. 

Proof of TheoremUl In the variational formula, Lemma [U select Y = exp(i3" + log A) to obtain 
trexp(i3" + logA) = max tr(X(i3" + log A) -XlogX + X) 

The latter expression can be written compactly using the quantum relative entropy: 

tr exp{H + log A) = max [tr(Xi?) - (D(X; A) - tr A)] (2) 

For each self-adjoint matrix H, Fact H] implies that the bracket is a jointly concave function of the 
variables A and X . It follows from Proposition [5] that the right-hand side of ([2]) defines a concave 
function of A. This observation establishes the theorem. □ 

Remark 7. The expression ([2]) states that the function f : H ^ trexp(i3" + log A) is the Fenchel 
conjugate of D( • ; A) — tr A. This observation implies that / is convex. 
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